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Abstract. Rubik's Cube is an easily-understood puzzle, which is origi- 
nally called the "magic cube" . It is a well-known planning problem, which 
has been studied for a long time. Yet many simple properties remain un- 
known. This paper studies whether modern SAT solvers are applicable to 
this puzzle. To our best knowledge, we are the first to translate Rubik's 
Cube to a SAT problem. To reduce the number of variables and clauses 
needed for the encoding, we replace a naive approach of 6 Boolean vari- 
ables to represent each color on each facelet with a new approach of 3 or 
2 Boolean variables. In order to be able to solve quickly Rubik's Cube, 
we replace the direct encoding of 18 turns with the layer encoding of 
18-subtype turns based on 6-type turns. To speed up the solving fur- 
ther, we encode some properties of two-phase algorithm as an additional 
constraint, and restrict some move sequences by adding some constraint 
clauses. Using only efficient encoding cannot solve this puzzle. For this 
reason, we improve the existing SAT solvers, and develop a new SAT 
solver based on PrecoSAT, though it is suited only for Rubik's Cube. 
The new SAT solver replaces the lookahead solving strategy with an 
ALO (at-least-one) solving strategy, and decomposes the original prob- 
lem into sub-problems. Each sub-problem is solved by PrecoSAT. The 
empirical results demonstrate both our SAT translation and new solving 
technique are efficient. Without the efficient SAT encoding and the new 
solving technique, Rubik's Cube will not be able to be solved still by 
any SAT solver. Using the improved SAT solver, we can find always a 
solution of length 20 in a reasonable time. Although our solver is slower 
than Kociemba's algorithm using lookup tables, but does not require a 
huge lookup table. 

Key words: Rubik's Cube, SAT encoding, SAT solver,Two-phase algo- 
rithm, planning, puzzle, state-transition problems. 



1 Introduction 

SAT solvers have attained success in many fields, and have been used widely 
for hardware design and verification, software verification, artificial intelligence, 
cryptanalysis, equivalence checking, model checking, planning, scheduling etc. 
However, there are still large real world instances that cannot be solved by SAT 
solvers. Rubik's Cube is such an example. It is a well-known planning problem, 
which is originally called the "magic cube". The puzzle game was invented in 
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1974[1] by Erno Rubik. So far, It has been studied for a long time. Yet many 
simple properties remain unknown. From the viewpoint of SAT applications, this 
paper studies this puzzle. 

With respect to this puzzle, one of the most natural questions is how many 
moves are required to solve Rubik's Cube in the worst case. This problem has 
been studied for over 30 years. There has been great progress. In 1995, Rcid 
proved that the lower bound on the number of moves and the upper bound is 
20 and 29, respectively [51617] . Since then, the upper bound was unceasingly im- 
proved. In 2006, the upper bound was reduced by Radu [8] to 27. In 2007, Kunkle 
and Cooperman used computer search methods to refine it to 26. In 2008, Rokicki 
|3I4) reduced further it from 25 to 22. In 2010, this open problem was settled. 
Rokicki, Kociemba, Davidson and Dethridge |9ll0j proved that God's number 
(i.e. the upper bound) for the Cube is exactly 20. They spent about 35 CPU- 
years of idle computer donated by Google to solve all 43,252,003,274,489,856,000 
positions of the Cube. Without a doubt, all the current approaches to proving 
the upper bound are time-consuming and space-consuming. How to finish the 
theoretical proof of the upper bound is yet a hard problem. 

Apart from the approach to compute directly positions of the Cube, one 
considered the other approaches. In 1985, Korf [TT] noted that problems such 
as Rubik's Cube can be divided into subgoals that are of the property called 
operator decomposability, and attempted to solve them by searching for macro- 
operators. Korf 's approach succeeded for the 2x2x2 version of Rubik's Cube, 
but failed to find an optimal solution for the full 3x3x3 Rubik's Cube, for which 
the solution lengths are close to those of human strategies. Rubik's Cube is also a 
well-known planning problem. This puzzle can be encoded as a planning problem 
in PDDL (Planning Domain Definition Language). Nevertheless, there is no 
report on solving successfully it using a sat-based planner such as SATPLAN 

The purpose of this paper is two-fold. The first purpose is to design an 
effective SAT encoding of Rubik's Cube. The second purpose is to improve the 
existing SAT solvers to extend the range of SAT applications. To attain the first 
purpose, we optimize the encoding of this puzzle in the following ways: encoding 
the At-Most-One (AMO) constraint for minimizing the number of moves by the 
2-product encoding [14] proposed recently, and replacing a naive approach of 6 
Boolean variables to represent each color on each facelet with a new approach of 
2 Boolean variables. The number of Boolean variables required by a cube state is 
cut from 8x6x6 = 288 to 8 x 6 x 3 = 144 or 8 x 6 x 2 = 96. Encoding this puzzle 
according to the idea of one-phase algorithm results in a very hard SAT problem. 
Therefore, we encode this problem according to the idea of two-phase algorithm, 
and consider the goal state of phase 1 as an additional constraint. In addition to 
efficiently encoding this puzzle, we improve the existing SAT solvers, and develop 
a new hybrid SAT solver based on PrecoSAT [13] , though the new solver is suited 
only for Rubik's Cube. The new solver selects some decision variables according 
to whether a variable occurs in a clause with the AMO constraint, and splits 
the original problem into some subproblems with those decision variables. Each 
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subproblems is solved by PrecoSAT. The empirical results demonstrate both our 
SAT translation and new solving technique are efficient. Without the efficient 
SAT encoding and the new solving technique, Rubik's Cube will not be able to 
be solved still by any SAT solver. Using the improved SAT solver, we can find 
always a solution of length 20 in a reasonable time. Although our solving speed 
is slower than the non-SAT solver such as Kociemba's algorithm using lookup 
tables, but does not require a huge lookup table, needs less memory. 10 MB 
RAM memory is sufficient for the approach to use the SAT solver. 

2 Preliminaries 
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Fig. 1. A state of Rubik's Cube 

Rubik's Cube is a 3-D mechanical cube, which consists of 27 smaller cubes 
(cubies) . The center cubies on each face and the core of Rubik's Cube forms a 
fixed frame. Other 20 cubies move around them. A full face of the larger cube 
is divided into 9 facelets, each of which is a face of a distinct cubies, where each 
face of the cubies is colored one of six colors. A state of Rubik's Cube can be 
considered as a permutation on 48 facelets, since 6 center facelets are fixed. In 
general, it may be described by six faces: "front" , "left" , "back" , "right" , "up" 
and "down" face, each with 3x3 facelets. Figure 1 presents a sate (position) of 
Rubik's Cube. A state of Rubik's Cube is said to be the home state (position) 
or solved state (position) if all facelets of each face in that state are the same 
color. Solving Rubik's Cube means restoring from a scrambled state into the 
home state. 

A move on Rubik's Cube refers to rotating the nine cubies on a face as a 
group 90 or 180 degrees around a central axis. We use the 'face turn metric' to 
compute the number of moves required to solve Rubik's Cube. That is, a single 
move is considered as a turn of any face, 90 or 180 degrees in any direction. 

Rubik's Cube has a total of eighteen different moves. These moves are con- 
ventionally denoted by U, U', U2, D, D' ', D2, L. L', L2, R, R', R2, F, F', F2, B, B' , 
and B2. Each clockwise 90 degree move is specified by just the face with no suffix, 
and each counterclockwise 90 degree move and each 180 degree move are speci- 
fied by the face followed by a prime symbol ('), and the digit 2, respectively. So U 
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here denotes a clockwise quarter turn of the "up" face, and similarly, D, L, R, F 
and B denote "down" , "left" , "right" , "front" and "back" , respectively. A solu- 
tion can be represented by a move sequence. As an example, the move sequence 
F2U2B'U'B2D'U2F'U2LDR2B2U2F'U2F'U2B2L2 is a solution to the state 
shown in Figure 1. That is, performing in turn each move in this sequence can 
restores that state to the home state. As defined in [3], we define Sis the set 
of 18 moves mentioned above, and A w = {U, U', U2, D, D', D2, L2, R2, F2, B2}, 
which will be used in subsequent sections and the two-phase algorithm given 
later. 

A Rubik's Cube consists of different cubies. By convention, the cubies are 
classified into edge cubies (two visible facelets), corner cubies (three visible 
facelets) and center cubies (one visible facelets, in the center of a side). Cor- 
respondingly, according to the cubie where a facelet belongs, the facelets are 
classified into edge facelets, corner facelets and center facelets. 

One of goals of this paper is to translate the Rubik's cube puzzle into a sat- 
isfability (SAT) problem with CNF (conjunctive normal form). A propositional 
logic formula is said to be in CNF if it is a conjunction ("and") of clauses, each 
clause being a disjunction ("ors") of Boolean literals, where each literal is either 
a variable or the negation of a variable. 

3 SAT encoding of the Rubik's Cube puzzle 

The Rubik's Cube puzzle may be described by the initial state, the move se- 
quence, the map relation of each move and the solved state. Its SAT encoding 
will consist of such ingredients. A Rubik's Cube has a total of six colors. A naive 
approach is that a color corresponds a Boolean variable. Thus, representing each 
color on each facelet requires six Boolean variables. In fact, six colors contains 
only log 6 ~ 2.6 bit information. So the number of Boolean variables can be 
reduced. Let 616263 be the binary representation of fc(0 < k < 5). The Boolean 
variable representation of the fc-th color is x\ (61), £2(62), £3(63), where Xi{bi) is 
Xi if bi is 1, and ~xl otherwise. For example, the Boolean variable representa- 
tion of the second color is xT, X2,X3- Therefore, 3 Boolean variables suffice for 
representing the color of each facelet. In our SAT encoding, states are divided 
into two categories: general state and iJ-state. A state is said to be £f-state 
if it can be transformed into the solved state by a sequence of the moves in 
A10 mentioned above. In the two-phase algorithm, each state in Phase two is 
-ff-state. For general states, we represent each color on each facelet with three 
Boolean variables. For H-states, we represent each color on each facelet with two 
Boolean variables. In the 2-variable scheme, we represent the colors of the front, 
left, back, right face in the solved state by 00, 01, 10 and 11, respectively, and 
then re-use 00 and 01 to represent the colors of the other two (top and down) 
faces. Notice, any move in in A\q cannot transform any facelet on top and down 
faces to somewhere on the other four faces. iJ-states are allowed to use only 
moves in Aiq. Therefore, under instates, the 2-variable scheme does not yield 
any confusing. 



5 



Let c(i, j, m) be the color of the j-th facelet in the i-the face under the m-th 
(to > 1) state (hereafter, color of facelet (i,j,m), for short), c(i, 4, 1) the center 
facelet color of the i-th face under the initial state. If the m-th state is the solved 
state, this state may be represented by 
A c(i,j,m) =c(i, 4,1) 

l<j<6,0<j'<8 

Using 3-variable scheme, c(i,j,m) — c(i,4, 1) is translated into c(i,j, to, 1) = 
c(i,4, 1, 1) A c(i,j, to, 2) = c(i, 4, 1, 2) A c(i, j, m, 3) = c(i, 4, 1, 3), where c(. . . 1), 
c(. . . 2) and c(. . . 3) are literals that denote the 1st, 2nd and 3rd bit of a color. 
Formula c(i,j, to, 1) = c(i,4, 1,1) can be translated into the following clauses: 
(c(i, j, to, 1) V -.c(i, 4, 1, 1)) A (-.c(i, j, to, 1) V c(i, 4, 1, 1)). 

An initial state of a cube is considered as State 1, which is interpreted as 
A B(c(i,j,l,k)) 

l<i<6,0<i<8,fe=l,2,3 

where B(c(i,j, 1, k)) is defined as c(i, j, 1, k) if the value of the fc-th bit color of 
facelet(i, j, 1) is 1, and ->c(i,j, 1, k) otherwise. 

Assume we take at most n — 1 moves to solve Rubik's Cube, and associate 
a Boolean variable st with each state t(l < t <n). " St — true " mean the t-th 
state is the solved state. Then, this constraint can be represented by 
A (-*iVc(t,j,t) = <:(», 4,1)) 

l<j<6,0<j'<8 

This formula can be converted easily into clauses. 

At any time, among S = {si, s 2 , . . . , s„}, we must ensure that exactly one St 
is true. The exactly-one constraint can be formalized by the at-least-one (ALO) 
and at-most-one (AMO) constraint. That is, exactly-one(S') = ALO (5)A AMO 
(S). The standard SAT encodings of constraints ALO and AMO are the follow- 
ing. 

ALO(S) = si V s 2 V ■ ■ ■ V s n 

AMO(5") ee {s?V s7|s 4 , Sj - G S,i <j} 
The ALO constraint ensures that a variable is true. And the AMO constraint 
ensures that no more than one variable is true. The standard AMO encoding 
requires much more clauses. To reduce the number of clauses, we can apply a 
two-product AMO encoding [Tl], which is recursively defined as 

l<k<n,k=(i-l)q+j 

AMO(S') ee AMO(t/) A AMO(V) A ((3*V m) A (ajf V v s )) 

l<i<p,l<j<q 

where p = \y/n},q = \j],U = u 2) ■ • • , u p }, V = {wi, v 2 , • . . , v q }, each ele- 
ment Ui in U and each element Vj in V are auxiliary variables. Here, AMO(J7) 
and AMO(V) apply the standard AMO encoding. The number of clauses and 
auxiliary variables required by this encoding are 2n + p(p — l)/2 + q(q — l)/2 
and p + q, respectively. When n = 20, the number of clauses required is 40 + 
4(4 — l)/2 + 5(5 — l)/2 = 56. For the n, the standard AMO encoding requires 
20(20-l)/2=190 clauses. 

To encode efficiently the constraints on the turns, we classify the turns of 
Rubik's Cube into six classes: u, d, I, r, f and b. Let Ufc(l < k < n) be a Boolean 
variable that is associated with the up turn of step k. We perform either U-, or 
U'- or f/2-type up turn at step k when Uk is true, and do the other turn otherwise. 
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The meaning of dk,lk,Tk, fk an d b k is similar. At any step, we have a unique 
turn. This constraint can be formalized by exactly-one(«fc, dk, h, r k, fk, bk) for 
1 < fc < n. Each u k corresponds actually three different turn: U,U',U2. We 
denote the U,U',U2 of step k by Boolean variables Uk,U' k ,Uk2. Clearly, these 
Boolean variables should satisfy ~^u k V exactly-one ([/fc, U' k , Uk2). Similarly, we 
have the following constraint conditions: 

-idfcV exactly-one(D/c, D' k , D k 2) 

^l k V exactly-onc(i fe , L' k , L k 2) 

~^r k y exactly-one ( Rk, R' k , Rk 2) 

->/feV exactly-one(Ffe,F^,Ffe2) and 

^6fcV exactly-one (Bfc, B' k , B k 2). 

A move can be considered as a mapping that maps each facelet c(i,j, k)(l < 
i < 6, < j < 8, 1 < k < n) at State Ho a facelet c(i',j',k — 1) at State 
k — 1. Let Mfe be a Boolean variable denoting one of 18 different moves at step 
k. The corresponding mapping is denoted by ju k - Then we have the following 
constraint condition. 

-nM fe V A c(i, j, fc) = / Mfc (c(i, j, fc)) 

l<i<6,0<j<8 

For the clockwise up turn Uk, the mapping relationship of fu k : c(i,j,k) — > 
c(i',j', k — 1) is the following. 

c(i mod 4 + 1, 0, fc) = c(i, 0, fc - 1), c(i mod 4 + 1, 1, fc) = c(i, l,k— 1), 

c(i mod 4 + 1, 2, fc) = c(i, 2, fc - 1) for 1 < i < 4 

c(5, 0, fc) = c(5, 6, fc - 1), c(5, 1, fc) = c(5, 3, fc - 1), 

c(5, 2, fc) = c(5, 0, fc - 1), c(5, 3, fc) = c(5, 7, fc - 1), 

c(5, 5, fc) = c(5, 1, fc - 1), c(5, 6, fc) = c(5, 8, fc - 1), 

c(5, 7, fc) = c(5, 5, fc - 1), c(5, 8, fc) = c(5, 2, fc - 1) 
The other facelets keep unchanged. 

It is easy to sec that any of the moves will move exactly 20 facelets, and pre- 
serve the other facelets. Furthermore, in the same type of moves, the unchanged 
facelets are the same. For example, for U, U', U2, they all preserve the facelets 
3-8 in faces 1-4, and all the facelets of face 6. To save the number of clauses, we 
split the above constraint condition into two parts: changed and unchanged. We 
use Boolean variables denoting the same type of moves to control the unchanged 
part. The changed part is controlled by Boolean variables denoting concrete 
moves. For the move U of step fc, we have the following constraint conditions. 

^MfeV A c(6,j,fc) = c(6,j,fc- 1) A c(i,j,k) = c(i,j,k- 1) 

0<j<8 l<i<4,3<j<8 

-t/ fe V A c(6,j,fc)=c(6,/(j),fc-l) A c(g(i),j,k) = c(i,j,k-l) 

0<j<8 l<i<4,0<j<2 

where f(j) is the mapping as shown above, and g(i) = i mod 4 + 1. For the 
move U' (U2) of step fc, the first condition above will share. The second condition 
needs to re-construct, but is easy. This can be done by replacing Uk in the second 
condition above with U' k ([4 2), and defining the corresponding /(j) and g(i). 
For the other moves such as D, F, B, L, R, similar constraint conditions are easily 
constructed. The optimization technique given above can reduce the number of 
clauses by above 1/3. 
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Some two-move sequences will yield the same result. For example, two-moves 
UD and DU have the same result states. To speed up the search, we remove 
the search on two-move sequences such as DU. The removing of such a search 
can be done by adding the following constraint clauses to the SAT encoding of 
Rubik's Cube. 

A (0"fc V dk+i) A (Ife V Tvk) A (/fe V 6fc+i)) 

l<fe<n 

4 Encoding Kociemba's algorithm and other constraints 

Using the SAT encoding given in the previous section, a modern SAT solver 
can find only solutions of length at most 13. However, many states have already 
been shown that requires 20 moves (e.g. supcrflip). To find such a solution, we 
add other tricks. A useful trick is to add the encoding of Kociemba's algorithm, 
which is a two-phase algorithm. The basic idea of the algorithm is to splits 
the problem into two almost equal subproblems, each of which can use a lookup 
table to search for exhaustively a solution. Here is the pseudo-code of Kociemba's 
algorithm. 

Kociemba's Algorithm 

d <- 0, t <- oo 
while d < t do 

for s e Sf 8 , ps e H do 
if d + D(ps) < t then 

find a better solution, using moves in A w 
t <- d + D(ps) 
end if 
end for 
d <- d+ 1 
end while 

This algorithm assumes that the original state is p, and applies some move 
sequence s € Sf 8 (for the definition of Sis, see Section 2) of length d to the 
original cube yielding ps which lies in H. This search process is called phase 
one. Here H is a subset of states that is composed of all patterns with following 
characteristics: 

1. The orientation of all corner cubies and edge cubies is correct. 

2. The edge cubies that should be in the middle layer are now located in the 
middle layer. 

These characteristics are preserved by moves in the set Am (which is defined 
in Section 2). The search process from the new state ps to the fully solved state is 
called Phase two. In this phase, each move is in A w . D(ps) returns the distance 
from the state ps to the solved state using moves in Am. To efficiently complete 
this computation, it is usually done by a lookup table. In fact, Phase one is 
usually also done by a lookup table. 
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It is impossible to encode directly the entire Kociemba's algorithm into a SAT 
problem, because it contains lookup tables. However, it is possible to encode the 
basic idea of Kociemba's algorithm with a CNF formula. Let Cube_CNF(n) 
denote a SAT encoding of Rubik's Cube with a total of n states, which can 
easily be done by the approach given in Section 3. Assume that the A:-th state 
Sfc reaches a state in H . A SAT encoding of Rubik's Cube containing the basic 
idea of Kociemba's algorithm can be described by the following formula: 

Cube_CNF(n) A Hstate(s fc ) A ^i -move(A:, n) 
where Hstate(sfc) is true if Sk is in H, and j4io_move(fc, n) is used to restrict 
moves from step k to step n to be moves in Aiq. Hstate(sfe) is defined as 
A (c(i,j,k) = c(i,j, 4) Vc(p(i),j,k) =c(i,j, 4))A 

l<i<4j=3,5 

A {c(i,j,k) = c(i,j,4)V c(p{i),j,k) =c(ij,4)) 

i=5,6A0<j"<8 

where p(i) denotes the opposite face of the i-th face, i.e., the mapping relation- 
ship of p is: 1 f> 3, 2 « 4, 5 f> 6. 

Based on the definition of Aiq, L, L', R, R', F, F', B and B' all are not in Aiq. 
So Aio_move(fc, n) may be described by the following logic formula. 
A ->(L m V L' m V R m V R' m V F m V F» m VB m V B' m ) 

k<m<n 

This means that after step k, neither clockwise nor counter clockwise 90 
degree turn of any face except for the up and down face is allowed. 

The encoding of Rubik's Cube containing the above two constraints can be 
considered as a SAT encoding of Kociemba's algorithm. Depending on different 
k, the efficiency of solution is different. In general, k is set to less than 12. That 
is, the length of Phase one is limited to 12. 

A move on a cube can change only 20 facelets, and keep the other 28 facelets 
unchanged. That is to say, the last second state from the solved state have 
28 facelets that are placed correctly. Based on this property, we encode the last 
second state as a additional constraint condition. Let t is the last turn operation. 
Then we add the following encodings: 

(^u t V unchanged28facelet(t/)) A(^d t V unchanged28facelet(D)) A 
(-i?tV unchanged28facelet(L)) A(^r t V unchanged28facelet(i?)) A 
(-./jV unchanged28facelet(i ;i )) A(^6 t V unchanged28facelet(S)). 
where unchanged28facelet(?7) can be encoded as follows. 

A c(i,j,f-l)=c(i,4,l) A c(6,j,t-l) = c(6,4,l) 

l<i<4,3<j<8 0<j<8 

the other unchanged28facelets are similar. The last second state from the final 
state of phase 1 can be encoded also in a similar way. 

5 A SAT Solver for Solving Rubik's Cube 

Based on our experimental observation, the PrecoSAT solver [13], the Gold 
Medal winners in the application category of the SAT 2009 competition, was 
the fastest on Rubik's Cube. Without any pruning strategy, it is hard to solve 
Rubik's Cube. Since PrecoSAT has no pruning strategy, it is not good choice to 
use directly PrecoSAT. To solve more efficiently Rubik's Cube, we built a new 
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solver based on PrecoSAT. The basic framework of this new solver is similar to 
MoRsat [H], but replaces the lookahead solving strategy with an ALO (at-least- 
one) solving strategy. Let the notation ^(x) denotes the resulting formula after 
assigning literal x true and performing iterative unit propagation. The basic 
idea of the ALO solving strategy is to decompose the original problem T into 
subproblems T{xi)(\ < i < n) if a clause C in J 7 is x\ V X2 V • • • V x n . If no sub- 
problem is satisfiable, the original problem is unsatisfiable. Each subproblem can 
be solved in a recursive way. Once the recursive depth reaches some constant, 
say 4, we use PrecoSAT to solve that subproblem. This ALO solving strategy 
has been applied successfully to MPhaseSAT [17]. In the new solver used for 
Rubik's Cube, the size of the clause C used to decompose the original problem 
is specified to 6. And the recursive depth of the ALO solving strategy is limited 
to 4. Under this assumption, this new solver may be described in a recursive way 
as follows. 

Algorithm SATsolver(.7 r , Zei>eZ){Initially level is set to 1} 
T <— LookaheadSimplify(J r ) 
find a clause C with 6 free variables 
if no such C was found then return PrecoSAT(J r ) 
for i — 1 to 6 do 

assume C = x\ V x-i V X3 V X4 V £5 V xq 

p <- T[xi) 

if level < 4 then 

if SATsolver(J r ', level + 1) = SAT then return satisfiable 
else if PrecoSAT (J"') = SAT then return satisfiable 
end for 

return unsatisfiable 

Procedure LookaheadSimplify corresponds multiple failed literal probes in 
PrecoSAT. It can be either a simple look-ahead or double look-ahead procedure 
in March [IS]. In some cases, this procedure can be removed. The reason why 
the size of the clause C in the above algorithm is limited to 6 is that there are 
six types of moves, and in the SAT encoding of Rubik's Cube, the clause of size 
6 is certainly obtained by encoding six types of moves. For general problems, 
the size of the clause C in the above algorithm should be as long as possible. 

6 Experimental Studies 

We solved Rubik's Cube with the SAT solver described in previous section on 
a 2.40GHz machine with Intel Core 2 Quad Q6600 CPU. Without adding the 
constraint conditions of two-phase algorithm given in Section 4, we found that 
determining whether a cube has an optimal solution with the maneuver (a move 
sequence is called a maneuver) length of 13 took about 7 hours in the worst 
case by our SAT solver. If the maneuver length of a solution is greater than 13, 
in general, no modern SAT Solver cannot find efficiently an optimal solution. 
However, if adding the constraint condition of two-phase algorithm, it is easy 
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to find a near-optimal solution. Hence, as a part of the SAT encoding of a 
cube, all the subsequent experiments assume that constraint condition of two- 
phase algorithm is add to the SAT encoding of Rubik's Cube. Table 1 presents 
the numbers of variables and clauses required to encode Rubik's Cube by our 
encoding strategy. Here, the length of a solution is limited to be 20. So all the 
numbers of clauses required are small, and are within 6700, although they have 
a little bit change for different lengths of the phase 1 maneuvers. 

Table 1. Numbers of variables and clauses required to encode Rubik's Cube 
with different lengths of the phase 1 maneuvers 



length of phase 1 


number of variable 


number of clauses 


9 


3570 


66028 


10 


3570 


66026 


11 


3594 


66201 


12 


3618 


66248 



The time to solve a cube greatly depends on the given states. The cube state 
shown in Figure 1 is an easy example. We found the following six solutions to 
this example by setting different faces to the U- and £>-face, and using different 
encoding strategies, based on two-phase algorithm. 

1. F2RDF'R'D2UL'DFL2UR2D2L2U'L2B2F2R2 

2. R 1 FRBU' B' D' LU' F'U' R2B2U' B2L2U2L2D2F2 

3. F'L'F'U2BU'F'DF2LB2R2D2F2R2FD2B'F2U2 

4. UF'RL'B2F2LBD'B'RB2LB2LD2B2L'B2U2 

5. U F' D2U2R' L2FU' F' RF2R 1 L2D2LF2D2R' B2U2 

6. UFR2LB2F2L'B'U'B'RD2B2D2B2U2L2B2R'L' 

Each solution took about 200 seconds. These solutions have a common char- 
acteristic: both the length of the phase 1 maneuvers and the length of the phase 
2 maneuvers are 10, and the total length is 20. These solutions are not the short- 
est. Finding a shorter solution will take much more time, since the length of the 
phase 1 maneuvers increases with the shortening of solutions. 

A special cube state which flips all 12 edges, called superfhp, is a hard example 
to our SAT solver. It has been proven to have a shortest maneuver length of 
20 moves to be solved. Finding a shortest solution to the superfhp is time- 
consuming. However, it is easy to find a near-optimal solution. Actually, we 
took 532 seconds to find a solution with the length of 21 moves as follows. 
BF'L'U2F2LD'U'F'R'LF2U2R2B2UR2D'B2U'R2 

If the length of the phase 1 maneuvers is known to be 13 in advance, for the 
superfhp, we can find easily a shortest solution of length 20 as follows. 
BFU2R' D'U L2B2R2BV2R' LV L2U' B2D 1 L2U' 

It took about 310 seconds. Note that this solution is different from that given 
by Cube Explorer [TB] that implements two-phase algorithm using lookup tables. 
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To test the generalized case, we generated 10 randomly cube states. For each 
state, the time required by our SAT solver to search for a solution of length 20 
or less is shown in Table 2. 

The upper bound on the length of the phase 1 maneuvers has been shown 
to 12 [7j. Our experiments verified that this fact is true. Furthermore, within 
length 12 of the phase 1 maneuvers, we found always a solution of length 20 
or less. As the length of the phase 1 maneuvers increases, the time to solve a 
cube increases sharply. In most of the cases, we can find a solution within 7000 
seconds. If the length of solutions is allowed to be 21, the time to solve a cube 
by our SAT solver never exceeds 1500 seconds. 



Table 2. Runtime took by our SAT solver to solve 10 random cube states 



cube state 


length of phase 1 


length of phase 2 


time (seconds) 


1 


10 


10 


1072 


2 


11 


9 


6785 


3 


10 


10 


137 


4 


11 


9 


6123 


5 


9 


11 


87 


6 


10 


10 


774 


7 


11 


9 


1489 


8 


10 


9 


1079 


9 


12 


8 


14096 


10 


10 


10 


329 



Table 3. Runtime took by Cube Explorer to solve 10 random cube states 



cube state 


solution length 


time (seconds) 


solution length 


time (seconds) 




(non-optimal) 


(non-optimal) 


(optimal) 


(optimal) 


1 


19 


0.3 


19 


9125 


2 


19 


4.5 


18 


531 


3 


18 


1.5 


18 


580 


4 


16 


7.3 


16 


23 


5 


19 


0.3 


18 


3243 


6 


18 


1.5 


17 


315 


7 


17 


2.1 


17 


25 


8 


19 


0.1 


18 


4216 


9 


19 


1.1 


18 


2219 


10 


19 


0.1 


18 


1452 



The test platform in Table 3 are the same as that in Table 2. Compared with 
the SAT solver, Cube Explorer using lookup tables is much faster. As shown in 
Table 3, Cube Explorer took at most 8 seconds to find a non-optimal solution 
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whose length is at most 20. In most of the cases, it found a solution immediately. 
However, to find an optimal solution, it is also time-consuming in some cases. 
For example, finding an optimal solution of cube state 1 took 9125 seconds. If 
information on lookup tables can be encoded in CNF, it is possible that the SAT 
solver can outperform Cube Explorer. 

Our encoding follows almost the idea of brute force enumeration. However, 
it is impossible for brute force enumeration to find a solution of length 20 with 
length 12 of the phase 1 maneuvers on a modern PC in a reasonable time, since 
brute force enumeration has to check about 15 12 x 9 10 « 10 23 7 states (note each 
state consists of 48 movable facclets) in the worst case. Hence, the SAT solver 
has its advantage over generalized approaches such as brute force enumeration. 

The main advantage of solving a cube with SAT solvers is that it does not 
need a huge lookup table indicating the distance from the home state, and the 
memory requirement is very less. 10 MB RAM memory is enough. 

7 Conclusions 

This paper is the first to solve Rubik's Cube using a SAT solver. The experimen- 
tal results reveal that our SAT encoding of Rubik's Cube and the improvement 
on the existing SAT solver are effective. Using the improved SAT solver, Rubik's 
Cube can be solved in a reasonable time. We believe that the encoding ap- 
proaches and the ALO solving technique should be useful beyond the planning 
domain such as Rubik's Cube. 

Here many open problem remains. For example, what is the optimal SAT 
encoding of Rubik's Cube? The heuristic approach is frequently used to speed 
up SAT solvers. Can the heuristics information be encoded into a SAT formula? 
Indeed, in this paper, we implemented the SAT encoding of partial heuristic 
information such as the goal state information of phase 1 in two-phase algo- 
rithm. However, we cannot still encode the information on lookup tables (pat- 
tern database of Rubik's Cube) in Kocicmba's algorithm, which is used to prune 
the superfluous search space. When a SAT formula is given, can we exploit logic 
structures on the heuristic information? What two-phase algorithm uses is a 
depth-first search technique. How to encode the depth-first search technique in 
CNF is also a challenge. Although one now has proved that God's number or the 
upper bound on the number of moves for Rubik's Cube is exactly 20, We cannot 
encode yet such a problem into a SAT problem. Nevertheless, in near future, 
we believe that it is possible to perform the SAT encoding of the upper bound 
of Rubik's Cube by extending the current SAT encoding technique of Rubik's 
Cube. 
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